Mosclust: a software library for discovering significant structures in bio-molecular data
نویسندگان
چکیده
منابع مشابه
Mosclust: a software library for discovering significant structures in bio-molecular data
UNLABELLED The R package mosclust (model order selection for clustering problems) implements algorithms based on the concept of stability for discovering significant structures in bio-molecular data. The software library provides stability indices obtained through different data perturbations methods (resampling, random projections, noise injection), as well as statistical tests to assess the s...
متن کاملDiscovering significant structures in clustered data through Bernstein inequality
The reliability of clusters discovered by a given clustering algorithm may be estimated by means of methods based on the concept of stability with respect to ”random perturbations” of the data. In this context, a major problem is to estimate the confidence of the measures of reliability; recently proposed procedures realizing this task are correct under the assumption that some probability dist...
متن کاملDiSTiL: A Transformation Library for Data Structures
DiSTiL is a software generator that implements a declarative domain-specific language (DSL) for container data structures. DiSTiL is a representative of a new approach to domain-specific language implementation. Instead of being the usual one-of-a-kind standalone compiler, DiSTiL is an extension library for the Intentional Programming (IP) transformation system (currently under development by M...
متن کاملA Platform Based on the Multi-dimensional Data Model for Analysis of Bio-Molecular Structures
متن کامل
Discovering statistically significant biclusters in gene expression data
In gene expression data, a bicluster is a subset of the genes exhibiting consistent patterns over a subset of the conditions. We propose a new method to detect significant biclusters in large expression datasets. Our approach is graph theoretic coupled with statistical modelling of the data. Under plausible assumptions, our algorithm is polynomial and is guaranteed to find the most significant ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Bioinformatics
سال: 2006
ISSN: 1367-4803,1460-2059
DOI: 10.1093/bioinformatics/btl600